A Semi-Supervised Adaptive Matrix Machine Approach for Fault Diagnosis in Railway Switch Machine

The switch machine, an essential element of railway infrastructure, is crucial in maintaining the safety of railway operations. Traditional methods for fault diagnosis are constrained by their dependence on extensive labeled datasets. Semi-supervised learning (SSL), although a promising solution to the scarcity of samples, faces challenges such as the imbalance of pseudo-labels and inadequate data representation. In response, this paper presents the Semi-Supervised Adaptive Matrix Machine (SAMM) model, designed for the fault diagnosis of switch machine. SAMM amalgamates semi-supervised learning with adaptive technologies, leveraging adaptive low-rank regularizer to discern the fundamental links between the rows and columns of matrix data and applying adaptive penalty items to correct imbalances across sample categories. This model methodically enlarges its labeled dataset using probabilistic outputs and semi-supervised, automatically adjusting parameters to accommodate diverse data distributions and structural nuances. The SAMM model’s optimization process employs the alternating direction method of multipliers (ADMM) to identify solutions efficiently. Experimental evidence from a dataset containing current signals from switch machines indicates that SAMM outperforms existing baseline models, demonstrating its exceptional status diagnostic capabilities in situations where labeled samples are scarce. Consequently, SAMM offers an innovative and effective approach to semi-supervised classification tasks involving matrix data.


Introduction
The turnout represents one of the three primary railroad outdoor components, with its condition having a direct impact on the safety of shunting and train station traffic [1].The switch machine locks the track in either the directional or the reverse direction, serving as the motion execution unit of the turnout, as shown in Figure 1.Currently, the primary maintenance approach for turnout equipment is a combination of "cycle repair" and "fault repair" [2].Maintenance personnel obtain the action current and power data of the switch machine through the centralized signaling monitoring (CSM) system and analyze the working status of the turnouts based on their professional knowledge and experience, thereby aiding in turnout maintenance.However, the conversion signals of switch machine equipment exhibit non-stationary and non-linear characteristics.The numerous types of turnout faults with complex characteristics make fault detection and classification very challenging.This maintenance approach has several drawbacks, including long fault delays, low fault diagnosis accuracy, and high labor intensity.Additionally, it often leads to "under-maintenance" and "over-maintenance," highlighting the limitations of the current maintenance model.Fault diagnosis of the switch machine is crucial in providing a reliability guarantee for the entire life cycle of the interlocking, attracting extensive attention from experts and scholars in the field of condition repair [3].With the advancement of intelligent operation and maintenance of railroad electric services, an increasing number of data-driven fault diagnosis methods have emerged for switch machines and other electromechanical equipment [4,5].These methods are categorized into three types according to their feature classification strategy: (1) distancebased, determining the abnormality by setting standard curves [6,7]; (2) classifier-based, including support vector machine (SVM) [8], k-nearest neighbor(KNN), [9] and random forest (RF) [10]; (3) deep-learning-based methods, including convolutional auto-encoder (CAE) [11], convolutional neural networks (CNN) [12], and long short-term memory networks (LSTM) [13].However, these methods demand substantial professional and fieldspecific expertise, and obtaining labeled data is both laborious and expensive, hindering intelligent fault diagnosis [14,15].
Since semi-supervised learning (SSL) offers a solution to requiring only a few labeled data, numerous studies have begun incorporating SSL to enhance fault diagnosis performance [16,17].Lao et al. [18] introduced a semi-supervised weighted prototype network (SSWPN) targeting the issue of switch machine fault diagnosis with limited labeling.Shi et al. [19] extracted dynamic current profile features, integrating SVM with semi-supervised strategies to ascertain the turnout state.In semi-supervised learning, pseudo-labeling serves as an essential strategy.Initially, the model is trained using a limited dataset of labeled examples.Subsequently, it assigns labels to a substantial volume of unlabeled data based on predictions.Predictions made with high confidence are considered accurate labels and are then integrated into the further training of the model [20].Although some studies have introduced semi-supervision into the field of fault diagnosis, semi-supervised fault diagnosis for switch machine troubleshooting remains nascent, facing several challenges.(1) Despite training with balanced data and evaluating balanced target data, an inherent imbalance in pseudo-labels emerges due to data similarity [21].In classification models, the loss penalty plays a crucial role in defining the boundary between two hyperplanes.Regrettably, applying a single penalty parameter across all samples results in hyperplane shifts in unbalanced class distributions, with the model penalizing minority classes and favoring the majority class for hyperplane delineation.(2) Acquired signals are inherently represented as fault images, showcasing varied correlations among the matrix's rows and columns.However, training with solely vector features, reliant on expert knowledge, inevitably compromises the spatial feature integrity when vectorizing matrix samples.
Several scholars have embarked on studies to directly utilize matrix samples for modeling.Capturing the rank of a matrix, a measure of the correlation between its rows and columns, is crucial for constructing a matrix classifier.Various researchers have introduced diverse approaches to address the matrix rank issue, including decomposing the matrix into rank-k matrices [22,23] or constraining the matrix rank to 1 [24].However, these techniques often necessitate predetermining the matrix rank, thus limiting their practicality.Conversely, support matrix machines (SMM) [25] suggest employing the kernel norm to approximate matrix ranks, enabling direct classification of 2D matrix features without presetting the ranks, thereby preserving the data's structural integrity.In recent years, efforts have been made to enhance SMM's performance through innovations like the multi-class support matrix machine (MSMM) [26], multi-class support matrix machine based on evolutionary optimization (MSMM-CE) [27], security transfer support matrix machine (STSMM) [28], among others.Recent studies have demonstrated that approximating rankings with kernel norms is suboptimal [29,30].The total of the singular values, each of which has a size indicative of its significance, forms the nuclear norm.Using the kernel norm to approximate matrix rank treats all singular values equally and lacks adaptiveness, significantly reducing its flexibility [31].
This study introduces a semi-supervised adaptive matrix machine (SAMM) method for diagnosing faults in switch machines, specifically applying it to analyze current signals from the ZDJ9 switch machine dataset.The diagnostic capabilities of this method have been validated through experimental comparisons, which demonstrate its superiority over other contemporary diagnostic approaches.The primary contributions of this research are threefold: (1) The incorporation of an adaptive low-rank regularizer selectively retains larger singular values, improving the approximation of the matrix rank and enhancing the extraction of fundamental connections between the rows and columns of matrix data.(2) The development of a probabilistic output strategy for SAMM, coupled with a semi-supervised learning (SSL) framework that utilizes these outputs to assign highconfidence pseudo-labels to unlabeled samples, effectively mitigating the challenges associated with a lack of labeled data.
(3) The introduction of an adaptive penalty term to address the imbalance in pseudo-label distribution, which adjusts the hinge loss penalty coefficient based on sample quantity to counteract learning biases.
The remainder of this paper is organized as follows: Section 2 offers a concise overview of the original SMM model.Section 3 describes the proposed SAMM model and its diagnostic framework in detail.Section 4 discusses the experimental validation of the method.Section 5 concludes the paper with a summary of the findings.

Support Matrix Machine
The support matrix machine (SMM) is a classification methodology specifically designed to handle input data in matrix forms, shown in Figure 2. Unlike conventional classification methods, which convert matrices into vectors and potentially compromise the structural integrity of the data, SMM preserves the matrix format.This retention enables SMM to fully utilize the structural information within matrices.It introduces a novel penalty term, the spectral elastic net, to leverage this advantage.By maintaining the matrix structure, SMM effectively captures the inherent structure and correlations of the data, thereby enhancing classification performance.
The objective function of SMM is formulated as follows: min where A i ∈ R p×q , i = 1, 2, . . ., N represents the training matrix data.y i ∈ {−1, 1} denotes the corresponding labels.The regression matrices W ∈ R p×q are inversely proportional to the distance of the hyperplane margin.b signifies the bias.β and ρ are two hyperparameters.
The objective function of the support matrix machine (SMM) comprises two principal components: the matrix-form hinge loss and the spectral elastic net penalty.Hinge loss, a common feature in classification models, promotes sparsity and robustness.In SMM, this loss function measures the classification error by calculating the discrepancy between the model's predictions and the actual labels for each training sample.The spectral elastic net penalty, integral to SMM, exploits the structural information of the feature matrix, capturing correlations within its columns and rows.In particular, tr W T W is employed to keep the model's complexity under control and avoid overfitting, ensuring that the model adheres to the rule of minimizing structural risk.The kernel norm ∥ W∥ * , defined as the total of the singular value decompositions of the matrix, serves as a metric for assessing the matrix's low rank, which facilitates the extraction of structural information from matrix data [32].However, minimizing the kernel norm by substituting the rank function with an approximation may compromise accuracy, especially in matrices with complex structures.The kernel norm represents only a relaxed approximation of the rank function, and it does not fully capture the matrix's true rank.Significant deviations from the true rank occur within the kernel norm when the singular values differ from 1 [33].This relaxation causes the kernel norm to over-penalize the matrix, potentially yielding a suboptimal low-rank approximation.

Semi-Supervised Adaptive Matrix Machine
In this section, the proposed semi-supervised adaptive matrix machine (SAMM) method is presented, which aims to address the challenges of insufficient labeled samples and pseudo-label imbalance, as well as to more accurately capture the correlations between matrix rows and columns.To resolve the non-smooth optimization issue within the SAMM model, an alternating update strategy is employed [34], facilitating the efficient solution of the model and the attainment of the optimal solution.

SAMM Model
The semi-supervised adaptive matrix machine (SAMM) combines semi-supervised learning with adaptive techniques, utilizing an adaptive low-rank regularizer to identify correlation within matrix data and an adaptive penalty term to mitigate the impact of interclass samples on hyperplane margins, as shown in Figure 3.By integrating probabilistic output, which incrementally expands the labeled sample set, SAMM not only gradually improves classification performance but also autonomously adjusts model parameters during the learning phase to suit various data distributions and feature architectures.The objective function for SAMM is delineated as follows: min The first term influences the model in the same way as observed in the support matrix machine (SMM).The second term β r ∑ k=1 log|σ k + ε| introduces an adaptive low-rank regularizer, where ε is a sufficiently small positive number, ensuring σ k + ε is not zero.Larger singular values correspond to row and column information within the matrix and should be preserved, whereas smaller singular values, which are often linked to irrelevant or redundant data, should be discarded.Adaptive low-rank regularizers maintain these larger singular values and reduce the smaller ones to zero or near-zero values.By minimizing this regularizer, SAMM effectively extracts low-rank matrix information and adaptively selects and preserves singular values associated with highly correlated data.Utilizing the adaptive low-rank regularizer allows SAMM to more accurately estimate the matrix's rank and extract strong correlations between rows and columns from the matrix data.This adaptivity enables SAMM to handle matrix-form data more effectively and improve its classification performance.The adaptive penalty term computes category weights according to the ratio of pseudolabeled samples per category, adjusting the hinge loss's penalty parameter ρ i based on these weights to optimize the handling of unbalanced samples.If samples are balanced, the penalty remains constant ρ i = ρ; for unbalanced samples, ρ i can be determined by N1 and N2 represent the number of samples in the majority and minority categories, respectively, while N denotes the combined total of samples from both categories.With the introduction of an adaptive penalty term, samples from the majority categories incur a lower penalty than those from the minority categories.Consequently, the SAMM model effectively considers the features of all categories within unbalanced datasets, thereby avoiding the problem of overemphasizing the majority categories while neglecting the minority ones.
To address the challenge of recognizing switch machine fault status with a limited number of labeled samples, we leverage both a small set of labeled and a substantial pool of unlabeled samples.We have developed a semi-supervised model that integrates probabilistic outputs, utilizing the SAMM model.Initially, the model undergoes training with the labeled dataset A l = {A i , y i } N l i=1 .The Platt Scaling [35] method is utilized to map the output of the SAMM model for each sample into the [0,1] interval, serving as a probability estimate of the sample's category membership.Utilizing the Wu method [36], we couple C(C − 1)/2 SAMM probability estimates pairwise into a single value, wherein the maximum probability output indicates the unlabeled sample's confidence level for true category membership.Confidence thresholding is a widely used technique to enhance pseudolabeling.By setting a higher threshold, the reliability of pseudo-labels is improved [37].A large unlabeled dataset A u = A j N u j=1 is provided, from which samples that exceed a specified confidence threshold are incorporated into the labeled dataset as reliable samples A r = Ãj , ỹj N r j=1 .Repeat this process until no samples exceed the confidence threshold or the maximum iteration count is achieved.This semi-supervised learning approach effectively addresses the switch machine fault diagnosis challenge with few labeled samples, diminishing the time and economic expenditures associated with sample labeling.

SAMM Learning Algorithm
Solving the SAMM model presents a non-smooth optimization challenge, complicating the search for a globally optimal solution.To address this issue, the alternating direction multiplier method (ADMM) is introduced as an effective algorithm for the SAMM model's resolution.The ADMM algorithm tackles the original challenge by breaking it down into two subproblems and applying iterative alternating updates.During each iteration, the ADMM algorithm incrementally approaches the optimal solution by updating primal and dual variables.This iterative procedure efficiently resolves the SAMM model, achieving an optimal solution characterized by low rank and sparsity.In the ADMM framework, Equation ( 2) is reformulated as Equation ( 4).arg min here The augmented Lagrangian function is subsequently defined as follows δ > 0 represents the step size, and Λ denotes the Lagrange multiplier.Following the ADMM framework, the objective function is divided into two subproblems (concerning Z and W ) and resolved through iterative computation.During each iteration, the solver sequentially minimizes Z and W, followed by an update to the Lagrange multipliers in alignment with these adjustments.W, Z, and Λ are updated as follows.
Here t and t + 1 signify the tth and t + 1th iterations, respectively.
(1) To solve the subproblem of Z, assume (W, b) and Λ are held constant, reducing it to a function concerning Z expressed as: Let I = W − Λ δ undergo singular value decomposition (SVD) in the following manner: According to [31], Z can be solved as where the nearest neighbor operator To address the subproblem concerning W, we undertake minimization of the expression encapsulating all terms associated with W as outlined in Equation ( 5).
Constructed via the Lagrange multiplier method with partial derivatives set to zero for b and β i .
Upon substituting Equation ( 12) into Equation (11), we obtain: Equation ( 13) results from differentiating concerning W and setting the derivative to zero, yielding: By reinserting Equations ( 12) and ( 14) into Equation (10), we derive the optimization problem for α as follows: Here The optimal value for b is determined by defining an average solution as specified in here Z * = {i : 0 < α i < C} Algorithm 1 outlines the proposed learning algorithm for SAMM.

Fault Diagnosis Framework
The comprehensive framework of the model proposed herein is depicted in Figure 4, with the principal steps summarized as follows: Step 1: Signal acquisition.Acquire current signals of the switch machine across various fault states.
Step 2: Feature extraction.Convert continuous current signals into 2D matrix samples via downsampling and binarization techniques, enabling efficient processing and model training.
Step 3: Train the SAMM Model.Labeled and unlabeled samples from the training dataset are used to build the SAMM model.The model integrates an adaptive low-rank regularizer with an adaptive penalty term, enhancing matrix structure information extraction, and addressing the pseudo-labeling imbalance challenge of semi-supervised learning.
Step 4: Test the SAMM Model.Predict the switch machine's fault status by inputting test samples into the SAMM model.

Description of the Data Set
The dataset originates from current signals generated by ZDJ9-type switch machines within the urban subway system.The SAMM model leverages current signals for fault diagnosis due to their direct correlation with the operational status of the railway switch machine.Although vibration and sound signals are also used in switch machine fault diagnosis [38,39], they present challenges in data collection and interpretation due to environmental noise and the need for precise sensor placement.Current signals, on the other hand, can be obtained through the CSM system, ensuring they are readily available and less susceptible to external noise.This method ensures minimal disruption to the switch machine's operation.This study's dataset was compiled by CASCO, a professional rail transit control system integrator, at specific stations along Shanghai Metro Line 13.The ZDJ9 switch machine uses a 380 V three-phase AC power supply, with phase currents A, B, and C supplying essential electrical power.This model completes a full state change in approximately 7 to 9 s, with current signals sampled at 25 Hz throughout duration.A typical current profile encompasses four principal phases: unlocking, transition, locking, and slow release.For this study, the A-phase current curve was selected for dataset construction due to its comprehensive representation of the switch machine's motion.As detailed in Table 1 and Figure 5, the A-phase current dataset spans nine distinct fault statuses, comprising eight fault states and one normal state.Throughout the experiment, labeled training samples per fault status varied from 5 to 30, unlabeled training samples from 45 to 25, with a constant 50 test samples.Employing down sampling and binarization techniques, each raw current signal image was transformed into a 64 × 64-dimensional feature matrix, facilitating further processing and model training.

Comparison Experiment
To optimize the classification performance of the SAMM model, three key parameters were precisely adjusted in the experiments: the low-rank coefficient β, the loss penalty coefficient ρ, and the step size δ.In the experiment, δ was set to 0.01.A 5-fold cross-validation method was utilized to select the structural parameters β and ρ from the set 2 −5 , 2 −4.5 , • • • , 2 5 , and the confidence threshold θ was set within the range of {0.5, 0.55, • • • , 0.95}.To guarantee fairness and comparability in our experimental outcomes, structural parameters for each model were optimized before undertaking fault diagnosis tasks, ensuring optimal operation across differing models.The identical parameter optimization process was applied to other comparative models, notably support vector machines (SVM), support matrix machines (SMM), and multi-class support matrix machines (MSMM).The models' optimal parameters were determined using a 5-fold cross-validation technique, and the structural parameters' value ranges were determined by consulting relevant literature.Structural parameters for deep learning models such as the convolutional auto-encoder (CAE) and convolutional neural network (CNN) were selected based on insights gleaned from relevant literature.All diagnostic models operated within a Windows 11 (64-bit) and Matlab 2023a software environment.The utilized PC's hardware configuration primarily included an Intel(R) Core(TM) i7-13700H CPU and 32.0 GB RAM.
To thoroughly assess the classification performance across various classifiers, three evaluation metrics were employed: precision rate, recall rate, and F1 score.The precision rate quantifies the proportion of accurately identified positive class samples among those deemed to be in a positive class, reflecting the classifier's accuracy.Conversely, recall gauges the proportion of all correctly identified samples within the actually positive class, indicating the classifier's coverage.The F1 score, a harmonized mean of precision and recall rates, serves as a singular comprehensive metric for gauging the classifier's overall effectiveness.Precision, recall, and F1 score, metrics suited for binary classification, were computed for each category using a macro-averaging approach and then averaged.For multicategory classification, these metrics are generalized from those utilized for k-category classification, as delineated in Table 2, and are defined as follows.
where tp c , f p c , f n c , tn c are true positives, false positives, false negatives, and true negatives within category c.To guarantee the results' reliability, each method was replicated 10 times for every sample case.Repeating the experiments aids in mitigating bias from random factors, thereby enhancing the robustness and credibility of the outcomes.Figure 6 displays the fault diagnosis precision for each model with merely five labeled training samples, showcasing that the SAMM model consistently outperforms others in terms of diagnostic precision across all experiments.The results demonstrate that the SAMM model sustains high diagnostic precision, even with a scarce quantity of labeled samples.Comparative results between SVM and matrix learning models (SMM, MSMM) illustrate that leveraging the structural information of images indeed enhances fault diagnosis performance.Given that SAMM adaptively leverages image structural information and mitigates the challenge of insufficient labeled samples, its overall diagnostic efficacy significantly surpasses that of the comparative models.7 presents the confusion matrix for the optimal diagnostic outcomes across each model.The confusion matrix reveals that the SAMM model excels in identifying various fault statuses.The highest diagnostic accuracies achieved by SVM, CAE, CNN, SMM, MSMM, and SAMM are 56.45%,82.00%, 84.15%, 83.75%, 85.00%, and 92.02%, respectively.The traditional SVM model's diagnostic accuracy significantly trails behind other models due to its inability to fully leverage image data's structural information.Despite the deep-learning-based CAE and CNN models' capability to extract higher-order image features, the scarcity of labeled samples limits their accuracy from reaching the desired level.By harnessing the structural features of image data, the matrix learning models SMM and MSMM outperform CAE and CNN, albeit with certain limitations.Conversely, the SAMM model adeptly extracts low-rank structural information from matrix samples and addresses category imbalance with adaptive penalty terms, achieving a leading diagnostic accuracy of 92.02%.It showcases superior fault diagnosis performance, even with a limited number of labeled samples.This underscores the SAMM model's advantages and efficacy in recognizing switch machine status.Significantly, the SAMM model's diagnostic accuracy surpasses that of the comparative models in every instance.Notably, across 5, 10, 15, 20, 25, and 30 labeled samples per fault status, the SAMM model achieved average diagnostic accuracies of 89.47%, 90.96%, 93.71%, 97.04%, 98.23%, and 98.80%, respectively, outperforming the lower accuracies recorded by the other models.The SAMM model's exceptional diagnostic performance is credited to its utilization of an ADMM-based solver, facilitating stable convergence to the global optimum and maximizing the model's potential.Crucially, SAMM's integration of an adaptive low-rank regularizer with an adaptive penalty term enables precise extraction of intrinsic low-rank structural information from matrix data and effectively addresses the prevalent issue of category imbalance in semi-supervised learning.Experimental findings indicate that SAMM's adaptive semi-supervised learning approach is particularly effective with a limited number of labeled samples.Remarkably, even with as few as five labeled samples, SAMM achieves a diagnostic accuracy of 89.47%, whereas other models exhibit a significant decline in performance.This affirms SAMM's superiority and practical utility in addressing the challenge of scarce labeled samples.Tables 3 and 4 detail the recall and F1 scores, respectively, for each model across varying numbers of labeled samples.When combined with the precision outcomes previously analyzed (Figure 8), a comprehensive evaluation of the models' overall diagnostic performance is facilitated.The recall and F1 score outcomes reveal that the SAMM model consistently outperforms all comparison models across various counts of labeled samples.With an increase in the number of labeled samples, while the recall and F1 scores for all models improve, the SAMM model's lead persists.Integrating the experimental findings of precision, recall, and F1 score, it becomes evident that the SAMM model's diagnostic efficacy surpasses that of other comparative models under scenarios with a limited number of labeled samples.This suggests that the strategies of employing an adaptive low-rank regularizer and adaptive penalty term enable the SAMM model to effectively tackle the challenges of scarce labeled samples and category imbalance, thereby demonstrating robust semi-supervised learning (SSL) capabilities.The experimental outcomes comprehensively illustrate that the SAMM model optimally utilizes the intrinsic structural information of image data.Concurrently, it addresses the challenges of scarce labeled samples and category imbalance through a semi-supervised learning strategy and adaptive mechanisms, culminating in superior performance in switch machine fault diagnosis compared to other models.Despite measurement noise and interference in real-world conditions, our method has demonstrated excellent fault diagnosis capabilities in experimental validations.The results indicate that, even with some noise and interference, the SAMM method consistently achieves high accuracy in identifying and diagnosing faults in the switch machine.This underscores the practical applicability of the SAMM model in real-world scenarios.

Conclusions
This study proposes the Semi-Supervised Adaptive Matrix Machine (SAMM) model tailored to address switch machine fault diagnosis.The SAMM model features an adaptive low-rank regularizer for precise extraction of highly correlated low-rank information from matrix data and for identifying correlations between the matrix's row and column.It employs a semi-supervised learning framework that incrementally assigns pseudo-labels to unlabeled samples based on high-confidence probabilistic outputs, thereby effectively leveraging unlabeled data.An adaptive penalty term is introduced to adjust the loss penalty in response to imbalances in category sample sizes, preventing the model from being overly biased towards the majority class.Experimental validations on the switch machine current signal dataset illustrate that SAMM surpasses other baseline models in fault diagnosis accuracy.The integration of the adaptive low-rank regularizer and adaptive penalty term effectively discerns the matrix data's inherent structure.Concurrently, the semi-supervised framework augments training data through pseudo-labeling, yielding commendable classification outcomes, even with limited labeled samples.
In practical applications, the SAMM method significantly enhances railway switch machine fault diagnosis through the analysis of current signals recorded by the CSM system.This enables preventive maintenance, reduces dependency on extensive labeled datasets, lowers maintenance costs and time, and improves diagnostic accuracy by minimizing false alarms and missed detections.Additionally, the real-time monitoring capabilities of the CSM system, combined with the SAMM method, facilitate quick response to faults, thereby reducing fault handling time and ensuring the continuity and safety of railway operations.
In future research, we will focus on vibration and sound signals to explore new approaches for multimodal fault diagnosis, aiming to leverage the advantages of integrating multiple sensors.We will also investigate variations of the adaptive low-rank regularizer and extend SAMM's application to fault diagnosis and anomaly detection across diverse fields.

Figure 4 .
Figure 4. Entire framework of the proposed fault diagnosis approach.

Figure 5 .
Figure 5. Fault status current curves of ZDJ9 switch machine.

Figure
Figure7presents the confusion matrix for the optimal diagnostic outcomes across each model.The confusion matrix reveals that the SAMM model excels in identifying various fault statuses.The highest diagnostic accuracies achieved by SVM, CAE, CNN, SMM, MSMM, and SAMM are 56.45%,82.00%, 84.15%, 83.75%, 85.00%, and 92.02%, respectively.The traditional SVM model's diagnostic accuracy significantly trails behind other models due to its inability to fully leverage image data's structural information.Despite the deep-learning-based CAE and CNN models' capability to extract higher-order image features, the scarcity of labeled samples limits their accuracy from reaching the desired level.By harnessing the structural features of image data, the matrix learning models SMM and MSMM outperform CAE and CNN, albeit with certain limitations.Conversely, the SAMM model adeptly extracts low-rank structural information from matrix samples and addresses category imbalance with adaptive penalty terms, achieving a leading diagnostic accuracy of 92.02%.It showcases superior fault diagnosis performance, even with a limited number of labeled samples.This underscores the SAMM model's advantages and efficacy in recognizing switch machine status.

Figure 7 .
Confusion matrix of the optimal results for each model.

Figure 8
Figure 8 illustrates the fault diagnosis accuracy for each model across varying counts of labeled training samples.The figure demonstrates that the diagnostic accuracy for all models improves to varying extents with an increase in labeled training samples, aligning with the inherent reliance of machine learning models on the volume of training data.Significantly, the SAMM model's diagnostic accuracy surpasses that of the comparative models in every instance.Notably, across 5, 10, 15, 20, 25, and 30 labeled samples per fault status, the SAMM model achieved average diagnostic accuracies of 89.47%, 90.96%, 93.71%, 97.04%, 98.23%, and 98.80%, respectively, outperforming the lower accuracies recorded by the other models.The SAMM model's exceptional diagnostic performance is credited to its utilization of an ADMM-based solver, facilitating stable convergence to the global optimum and maximizing the model's potential.Crucially, SAMM's integration of an adaptive low-rank regularizer with an adaptive penalty term enables precise extraction of intrinsic low-rank structural information from matrix data and effectively addresses the prevalent issue of category imbalance in semi-supervised learning.Experimental findings indicate that SAMM's adaptive semi-supervised learning approach is particularly effective with a limited number of labeled samples.Remarkably, even with as few as five labeled samples, SAMM achieves a diagnostic accuracy of 89.47%, whereas other models exhibit a significant decline in performance.This affirms SAMM's superiority and practical utility in addressing the challenge of scarce labeled samples.

Figure 8 .
Figure 8. Fault diagnosis precision under different labeled samples.

Table 1 .
Fault status phenomena and causes of ZDJ9 switch machine.

Table 3 .
Recall rate of Different Models with Varying Number of Labeled Samples.

Table 4 .
F1 score of different models with varying numbers of labeled samples.